Context-Free TextSpotter for Real-Time and Mobile End-to-End Text Detection and Recognition
نویسندگان
چکیده
In the deployment of scene-text spotting systems on mobile platforms, lightweight models with low computation are preferable. concept, end-to-end (E2E) text is suitable for such purposes because it performs detection and recognition in a single model. However, current state-of-the-art E2E methods rely heavy feature extractors, recurrent sequence modellings, complex shape aligners to pursue accuracy, which means their computations still heavy. We explore opposite direction: How far can we go without bells whistles spotting? To this end, propose text-spotting method that consists simple convolutions few post-processes, named Context-Free TextSpotter. Experiments using standard benchmarks show TextSpotter achieves real-time GPU only three million parameters, smallest fastest among existing deep spotters, an acceptable transcription quality degradation compared heavier ones. Further, demonstrate our spotter run smartphone affordable latency, valuable building stand-alone OCR applications.
منابع مشابه
End-To-End Face Detection and Recognition
Plenty of face detection and recognition methods have been proposed and got delightful results in decades. Common face recognition pipeline consists of: 1) face detection, 2) face alignment, 3) feature extraction, 4) similarity calculation, which are separated and independent from each other. The separated face analyzing stages lead the model redundant calculation and are hard for end-to-end tr...
متن کاملSEE: Towards Semi-Supervised End-to-End Scene Text Recognition
Detecting and recognizing text in natural scene images is a challenging, yet not completely solved task. In recent years several new systems that try to solve at least one of the two sub-tasks (text detection and text recognition) have been proposed. In this paper we present SEE, a step towards semi-supervised neural networks for scene text detection and recognition, that can be optimized end-t...
متن کاملEnd-to-End Text Recognition with Hybrid HMM Maxout Models
The problem of detecting and recognizing text in natural scenes has proved to be more challenging than its counterpart in documents, with most of the previous work focusing on a single part of the problem. In this work, we propose new solutions to the character and word recognition problems and then show how to combine these solutions in an end-to-end text-recognition system. We do so by levera...
متن کاملEnd-to-end Window-Constrained Scheduling for Real-Time Communication
This paper extends our original work on window-constrained scheduling, to address the problem of meeting end-to-end service guarantees across a sequence of servers. We describe an algorithm, called Multi-hop Virtual Deadline Scheduling (MVDS), that attempts to minimize end-to-end window-constraint violations, while maximizing link utilization for a series of real-time streams. The challenge pos...
متن کاملJEJUNAL EVERSION MUCOSECTOMY AND INVAGINATION: AN INNOVATIVE TECHNIQUE FOR THE END TO END PANCREATICOJEJUNOSTOMY
ABSTRACT Background: The pancreatojejunostomy has notoriously been known to carry a high rate of operative complications, morbidity and mortality, mainly due to anastomotic leak and ensuing septic complications. Objective: In order to decrease anastomotic leak and its attendant morbidity and mortality in operations requiring a pancreato-jejunal anastomosis, and also in order to simplify the op...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Lecture Notes in Computer Science
سال: 2021
ISSN: ['1611-3349', '0302-9743']
DOI: https://doi.org/10.1007/978-3-030-86331-9_16